Submit

Vision Mcp Server | 图片分析 Mcp

@Markusbetter

This MCP addresses the visual recognition limitations of text-based models by enabling accurate image description and identification, making it excellent for AI-assisted reference design interface analysis. It currently supports dropping links into the dialog box or placing images in the project folder for recognition. The tool can be integrated with MCP platforms like Claude Code, Cline, and Trae. Beyond programming applications, it also provides visual recognition capabilities for models that lack native image processing functionality. For visual models, users can select their preferred model from ModelScope community and replace it during MCP configuration setup. 📱 Daily Use Cases: Send screenshots to directly identify errors or issues Share image links or place screenshots in the project folder for AI-assisted layout optimization Submit product image links to generate promotional copy 该mcp可以解决文字模型图片识别的视觉的问题,可以准确识别描述图片,用来给AI看参考设计界面很nice~ 目前支持丢链接到对话框,以及把图片放到项目文件夹进行识别。 支持加入到Claude Code,Cline和Trae等mcp工具中。 除了编程外,如果你使用的模型本身不支持视觉图片识别,也可以使用~ 视觉模型可以自己去魔搭社区选一个自己喜欢的,在填写mcp配置的时候替换即可 📱 日常使用场景 - 截图发过去,直接告诉哪里出错了 - 丢过去一个图片链接或者截图放到项目文件夹内,让AI帮忙优化布局 - 发个产品图链接,让AI写推广文案
Overview

What is Vision MCP Server?

Vision MCP Server is a Model Context Protocol (MCP) server designed for image analysis and description, addressing the visual recognition limitations of text-based models. It enables accurate image identification and description, making it ideal for AI-assisted design interface analysis.

How to use Vision MCP Server?

Users can utilize the server by dropping image links into the dialog box or placing images in the project folder for recognition. It can be integrated with MCP platforms like Claude Code, Cline, and Trae. Users can also select their preferred visual model from the ModelScope community during configuration.

Key features of Vision MCP Server?

  • Supports local image files and online image URLs.
  • Intelligent image analysis based on ModelScope AI models.
  • Full compatibility with MCP protocol.
  • TypeScript support with complete type definitions.

Use cases of Vision MCP Server?

  1. Identifying errors or issues by sending screenshots.
  2. Optimizing layouts by sharing image links or screenshots.
  3. Generating promotional copy from product images.

FAQ from Vision MCP Server?

  • Can Vision MCP Server analyze all types of images?

Yes! It can analyze various image types as long as they are in supported formats.

  • Is there a cost to use Vision MCP Server?

No, it is free to use for everyone.

  • How accurate is the image analysis?

The accuracy depends on the quality of the image and the complexity of the content.

Server Config

{
  "mcpServers": {
    "vision-mcp-server": {
      "command": "npx",
      "args": [
        "-y",
        "vision-mcp-server"
      ],
      "env": {
        "MODELSCOPE_TOKEN": "your_modelscope_token_here",
        "MODELSCOPE_MODEL": "Qwen/Qwen3-VL-30B-A3B-Instruct"
      }
    }
  }
}
© 2025 MCP.so. All rights reserved.

Build with ShipAny.